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Abstract 

Analyzing the mixing time of random walks is a well-studied problem with applications in 
random sampling and more recently in graph partitioning. In this work, we present new analysis 
of random walks and evolving sets using more combinatorial graph structures, and show some 
implications in approximating small-set expansion. On the other hand, we provide examples 
showing the limitations of using random walks and evolving sets in disproving the small-set 
expansion hypothesis. 

1. We define a combinatorial analog of the spectral gap, and use it to prove the convergence 
of non-lazy random walks. A corollary is a tight lower bound on the small-set expansion 
of graph powers for any graph. 

2. We prove that random walks converge faster when the robust vertex expansion of the graph 
is larger. This provides an improved analysis of the local graph partitioning algorithm using 
the evolving set process. 

3. We give an example showing that the evolving set process fails to disprove the small-set 
expansion hypothesis. This refutes a conjecture of Oveis Gharan and shows the limitations 
of local graph partitioning algorithms in approximating small-set expansion. 
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1 Introduction 


Analyzing the mixing time of random walks is a fundamental problem with many applications in 
random sampling [LPW08]. The evolving set process is an elegant tool introduced by Morris and 
Peres [MP05] to provide sharp analyses of mixing time (see the survey [MT06]). Recently, random 
walks and the evolving set process have also been used in designing local algorithms for graph 
partitioning [ST13, ACL06, AP09, OT12, KL12]. The evolving set process is the most powerful 
among the local graph partitioning algorithms, and Oveis Gharan [Ovel3] even conjectured that it 
can be used to disprove the small-set expansion hypothesis [RSIO]. A common theme of this paper 
is to study the power and limitations of this technique, from analyzing mixing time to local graph 
partitioning and approximating small-set expansion. 


Random Walks and Mixing Time 

We consider random walks in a weighted undirected graph G = (V, E) with a nonnegative weight 
w{e) on each edge e £ E. Let n = \V\ and m = \E\. For simplicity, we assume that the graph G is 
regular and the weights are scaled such that the weighted degree of i is = 1 for all 

i £ V throughout this paper, but we will mention how to deal with general graphs in Section 2.5. 
Let A be the n x n random walk matrix of G with Aij = w{i,j). Let po £ K” be an initial 
probability distribution, and let pt := A^po be the probability distribution after t steps of random 
walks. When G is connected and non-bipartite, it is well-known that pt will converge to the uniform 
distribution. The mixing time is defined as 

Tmix = min{t : \\pt -||i ^ 1/4 for all initial distribution pol- 

n 

One approach to analyze the mixing time is to look at the eigenvalues of the random walk matrix. 
Let the eigenvalues of A be 1 = Ai ^ A 2 ^ ^ ^ — 1. By basic spectral graph theory, it can 

be shown that 1 > A 2 if and only if G is connected, and A^ > — 1 if and only if G is non-bipartite. 
This implies that, when G is connected and non-bipartite, pt will converge to the first eigenvector, 
and thus the uniform distribution is the unique limiting distribution of the random walk. Let 
A := max{A 2 , |A„|}, and let 1 — A be the spectral gap of the random walk matrix. A standard 
calculation shows that the mixing time is upper bounded by 0(log(n)/(l — A)). 

For many problems, it is useful to have combinatorial characterizations of graphs with fast mixing 
time. For two subsets S,T <£V, let E{S,T) be the set of edges with one vertex in S and another 
vertex in T, and let w{S,T) := Yl,e^E{ST) The expansion of a set S' C R and the expansion 

of a graph G are defined as 

^(S) := and .^(G):=^ min^^d(S), 

Cheeger’s inequality for graphs [AI 086 , AM85] states that 

i(l-A 2 )^</>(G)^ v' 2 ( 1 -A 2 ), 
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and thus 1 — A 2 = VL{(j){G)‘^). Having large conductance is not enough to guarantee fast mixing 
time, as \n may be very close to —1. There is a simple trick to bypass this issue: one can guarantee 
that ^ 0 by considering “lazy” random walks (with probability 1/2 stay put), and this implies 
that the mixing time of lazy random walks is upper bounded by 0(log(n)/(/(G)^). 

Another approach to analyze the mixing time is to directly use the graph structures. Lovasz 
and Simonovits [LS90] developed a combinatorial method to prove that the mixing time of lazy 
random walks is 0(log(n)/(/>(G)^). This method is more flexible in incorporating additional graph 
structures. Given a parameter 0 < <5 ^ 1/2, the J-small-set expansion is defined as 


(/^(G) := min (/)(5). 
S:|S|s:(5n 


Lovasz and Kannan [LK99] proved that the mixing time of lazy random walks is 

dx 


T < 

mix 


ll/n X(I)^{GY' 


As we will discuss in more details shortly, this combinatorial approach can also be used to design 
local graph partitioning algorithms for approximating small-set expansion. 


Evolving Sets 

The evolving set process is a Markov chain on subsets of V with the following transition rule: If 
the current set is S, choose U uniformly from [0,1] and the next set is dehned as 

S := {y: w{y,S) ^ U}. 

Morris and Peres [MP05] used the evolving set process to strengthen Lovasz and Kannan result 
to bound the uniform mixing time of lazy random walks by the expansion profile. An important 
definition in their analysis is the gauge of a set S and the gauge of a graph G, which are defined as 


V^(5) = 1-E[y^| 5|/|5|] and V’(G') := ^ mm^^^^(5). 

Morris and Peres [MP05] showed that the convergence rate of random walks is bounded by the 
gauge, and the mixing time of random walks is 0(log(n)/'!/(G)). They proved that ipiG) ^ 
Q{(j)‘^{G)) for lazy graphs, and this implies that the mixing time of lazy random walks is upper 
bounded by 0(log(n)/^(G)^). We refer the interested reader to [LPW08] for an excellent introduc¬ 
tion of the evolving set process. 


Local Graph Partitioning Algorithms 

Spielman and Teng [ST13] used random walks to design the first local graph partitioning algorithm, 
which outputs a set S of approximately optimal expansion with running time depends only on ISj 
and polylog(n). Their analysis is based on the approach of Lovasz and Simonovits [LS90] on 
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analyzing mixing time. Andersen and Peres [AP09] and Oveis Gharan and Trevisan [OT12] used 
the evolving set process for local graph partitioning, and provide the current best known algorithm 
in terms of both the approximation ratio and the running time. 

Theorem 1 ([AP09, OT12]). For any target set S* and any e > 0, there is a subset S' with 
IS"! ^ I S'* 1/2, such that if we start the evolving set process with {t>} for v G S', then with constant 
probability the algorithm returns a set S with (j){S) = 0{y(l){S*)/e) and |5| = 0(|5*|^''"'^), and the 
running time is 0 {\S*\^^'^^cj){S )~2 log^n). 


Small Set Expansion 

The small set expansion hypothesis proposed by Raghavendra and Steurer [RSIO] states that for 
any e, there exists 6 such that it is NP-hard to distinguish the following two cases: 

1 . There is a set S with <p{S) ^ e and |5| ^ 5n; 

2 . i/>(5) ^ 1 — e for every set S with [S’! ^ 6n. 

This hypothesis is shown to be closely related to the unique games conjecture [RSIO]. The local 
graph partitioning algorithms provide bicriteria approximation algorithms for computing small set 
expansion (ps{G)- It is observed in [OT12, KL12] that if the output size guarantee of the above local 
graph partitioning algorithm is improved from 0(|S'*|^'’'^) to 0(|S'*|), then the small-set expansion 
hypothesis is false. Oveis Gharan [Ovel3] suggested a plan to prove such an output size guarantee 
using the evolving set process. 


1.1 Our Results 

Combinatorial Analog of Spectral Gap 

We define a combinatorial analog of spectral gap with which we 
time of non-lazy random walks. Recall that 4>{G) is dehned as 

mm -—- = mm 1 — 

Scy,|SRn/2 |5| Scy,|5Kn/2 

We dehne the combinatorial gap as 


can directly analyze the mixing 
w{S, S) 


ip(G) := min 

SCV,TCV,\S\ = \T\^n/2 


w{S,T) 

| 5 | 


( 1 . 1 ) 


Note that (p{G) is small if there exists a near-bipartite component. We prove the following combi¬ 
natorial analog of the spectral analysis of mixing time. 

Theorem 2. For any graph G, ip{G) ^ fl((p^(G)). 
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By the aforementioned result of Morris and Peres [MP05] , one immediate corollary is that the mixing 
time of non-lazy random walks is upper bounded by 0(log(n)/(G)). An implication is that adding 
self-loops of weight (^{G) (instead of 1/2) is enough to guarantee mixing time 0{\og{n)/(tP‘{G)) in 
any graph, which may have applications in speeding up random sampling algorithms. 

Our proof of Theorem 2 is based on a new analysis of the approach by Lovasz and Simonovits [LS90] 
using the bar chart in Figure 2.2. We believe that the new analysis is more intuitive and provides 
better insights into what combinatorial properties are needed for fast mixing. 

Using Theorem 2 and the results in [KL14], another corollary is the following lower bound on 
small-set expansion of graph powers. 

Corollary 1. For any graph G and any integer t ^ 1, 

<^ 5 / 4 (G‘) = n(min{\/t • (j)siG), 1}). 

The same result is proved in [KL14] for lazy graphs, and here we prove it for all graphs. Note that 
it is not true that (j){G^) = fl(min{\/t • 4>{G), 1}) when G is bipartite, but the above corollary shows 
that it is true for small-set expansion even when G is bipartite. As shown in [RS14], this result can 
be used to amplify hardness results for the small-set expansion problem. 


Vertex Expansion 

The robust vertex expansion is defined by Kannan, Lovasz and Montenegro [KLM06] as follows: 
For S CV, let Ni/ 2 {S) := min{|T| | T C U — S' and w{S,T) ^ ^w{S,V — S)}. Define 

(/'^(S) := min I— \ 11 and (j)^(G) := min </'^(S) 

I |5| J S:\S\^n/2 

as the robust vertex expansion of a set S and the graph G. This definition slightly differ from the 
original definition in [KLM06] by bounding the vertex expansion above by one. We do this because 
it is the range of interest and the statement of our result would be much cleaner. Also define 

T(S) := (j)(S) ■ and T(G) := min T(S) 

S:|SKn/2 

as the minimum product of the edge expansion and the robust vertex expansion. It is proved 
in [KLL15] that 

1 - A2 = L!(T(G)), 

and that the spectral partitioning algorithm and the local graph partitioning algorithm using per¬ 
sonal pagerank vectors [ACL06] achieve better approximation when the robust vertex expansion is 
large. We prove a similar result for random walks and evolving sets. 

Theorem 3. For lazy graphs G, 'ii}{G) ^ D('I'(G)). 

A corollary is an improved analysis of Theorem 1 when the robust vertex expansion of G is large. 
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Corollary 2. For any target set S* and any e > 0, there is a subset S' with |S"| ^ \S*\/2, such 
that if we start the evolving set process with {?;} for v G S', then with constant probability the 
algorithm returns a set S with = 0{(j){S*)/e) and |5| = and the running time is 

0(|5*|i+2^(/>(5)-hog2n). 

Note that the conclusion ^(5) = 0{(l>{S*)/e) implies that (/>(5) = 0{(j){S*)/(p^(S)) = 0{(j){S*)/(f)^(G)). 
In particular, this implies that the evolving set algorithm is a constant factor approximation algo¬ 
rithm when c/)^(G) is a constant, for example when G is a planted random graph. This shows that 
the evolving set algorithm matches the improved analysis of the spectral partitioning algorithm 
in [KLL15]. We refer the reader to [KLL15] for more discussions and motivations for robust vertex 
expansion. 


Limitations 

The subexponential time algorithm for small-set expansion by Arora, Barak and Steurer [ABSIO] 
uses eigenspace enumeration and random walks. The short code example in [BGHMRS12] shows 
the limitation of the eigenspace enumeration method. It is a natural question to ask whether 
random walks can be used to disprove the small-set expansion hypothesis. There were very few 
results showing the limitations of these random walks based algorithms (see [ZLM13] for the only 
such result that we know of). One main difference between the truncated random walk algorithm 
by Spielman and Teng [ST13] and the evolving set algorithm by Andersen and Peres [AP09] and 
Oveis Gharan and Trevisan [OT12] is that the random walk algorithm is deterministic while the 
evolving set algorithm involves much randomness. Oveis Gharan [Ovel3] conjectured in his thesis 
(Gonjecture 12.3.4) that there is a small but nontrivial probability that all the sets explored by the 
evolving set process is of size 0(|5*|), and argued that this would disprove the small-set expansion 
hypothesis. We present an example for which the evolving set algorithm fails with probability one, 
refuting Oveis Gharan’s conjecture. 

Theorem 4. Given any e, there exists 5e such that for any 6 > 0, there is a graph G such that 
4>s{G) ^ but any subset of volume ^ returned by the evolving set algorithm in [AP09, OT12] 
has expansion at least 1 — e with probability one. 

The example is a /c-ary e-noisy hypercube, where the dimension cuts are of size n/k with expansion 
e. We show that, however, the evolving set algorithm will only explore the Hamming balls, and the 
expansion is at least 1 — e for all Hamming balls of size OinjF). We note that this example also 
shows that the random walk algorithm [ST13, KL12] and the pagerank algorithm [ACL06, ZLM13] 
fail to disprove the small-set expansion hypothesis; see Section 3. 

We believe that this example exposes the limitations of all known local graph partitioning algo¬ 
rithms, and can be used as a basis to prove further lower bounds (e.g. to show that the analysis 
of the OlysJf){S^ log(|5|))-approximation of the evolving set algorithm in Theorem 1 is tight when 
e = l/log(|5|)). 
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1.2 Relations with Previous Work 


Combinatorial Analog of Spectral Gap 

We note that the original analyses of Lovasz and Simonovits [LS90] and Andersen and Peres [AP09] 
heavily rely on the laziness assumption and cannot be used to work with the combinatorial gap. 
The bar chart in Figure 2.2 is the new element introduced to analyze the combinatorial gap as well 
as the robust vertex expansion. 

Trevisan [Trel2] defined the bipartiteness ratio j3{G) of a graph and proved that + 1 is related 
to /3(G) as if 1 — A 2 is related to (/)(G) stated by Cheeger’s inequality. After formulated and 
proved Theorem 2, we observe that Trevisan’s result combined with the spectral argument can 
also be used to derive the corollary that the mixing time of non-lazy random walks is bounded by 
0(log(n)/y?(G)^). However, we remark that Theorem 2 and Corollary 1 cannot be derived from 
Trevisan’s result and the spectral approach, and also that the formulation of Theorem 2 is new. 

Bilu and Linial [BL06] defined a combinatorial property called “jumbleness”, proved that it is a 
log((i)-approximation to the spectral gap where d is the maximum degree of the graph, and used it 
to establish a converse to the expander mixing lemma. The definition of the jumbleness is similar 
to our definition of the combinatorial gap in that it also concerns about w{S,T) for two subsets of 
vertices S,T C V, but the precise definition and the theorem obtained are incomparable to what 
we have in this paper. 


Mixing Time and Local Graph Partitioning 

The results in Lovasz and Kannan [LK99] and Kannan, Lovasz and Montenegro [KLM06] show that 
the mixing time of lazy random walks is 0(log(n)/'L(G)), among other conditions that imply faster 
mixing. However, their results cannot be applied to analyze local graph partitioning algorithms as 
in Corollary 2. 

Besides the random walk algorithm [ST13, KL12] and the evolving set algorithm [AP09, OT12], 
there is also a local graph partitioning algorithm using pagerank vectors [ACL06, ZLM13]. In terms 
of the approximation guarantee, the output size, and the running time, the pagerank algorithm is 
subsumed by the evolving set algorithm in [AP09, OT12]. 

In [KLL15], it was shown that the pagerank algorithm performs better when the robust vertex 
expansion is large. Similar results were not known for random walks and evolving sets, as the 
spectral techniques in [KLL15] are not applicable. These results are proved in this paper by a 
new analysis of the combinatorial approach of Lovasz and Simonovits [LS90]. Finally, we remark 
that this paper is a subsequent work of [KLL15], and both the results and the techniques are 
different from [KLL15], especially the combinatorial analog of spectral gap, the counterexample 
for the evolving set algorithm, and the new analysis of Lovasz and Simonovits approach using the 
barchart. 
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2 Faster Convergence 


In this section, we prove the positive results about faster convergence rates of random walks and 
evolving sets. Our proofs are based on the combinatorial method of Lovasz and Simonovits [LS90], 
and we will begin with an introduction of their techniques in Section 2.1, and then we will discuss 
the proof outline and highlight the new idea in Section 2.2. Then, we will prove Theorem 2 about 
combinatorial analog of spectral gap in Section 2.3 and then prove Corollary 1 about small-set 
expansion of graph powers. Then, we will prove Theorem 3 about robust vertex expansion in 
Section 2.4 and then show its application in local graph partitioning. Finally, we will discuss how 
to extend the results to non-regular graphs in Section 2.5. 


2.1 Lovasz-Simonovits Curve 


For any vector p G M”, Lovasz and Simonovits [LS90] study the curve C{p) : [0, n] —>■ M that plots 
the cumulative sum of p defined as 


C{p,x) 


max > c(i)-p(i). 

ce[0,l]-EiC(i)=a;^^ 


( 2 . 1 ) 


In words, C{p,x) is just the sum of the first x largest elements in p when x is a positive integer, 
and the curve C{p, x) is defined for all x G [0, n] by piecewise linear extension. It is clear from 
the definition that C{p) is a concave function. We are interested in studying the curve C{A^p, x) 
where ^ is a random walk matrix and p is a probability distribution. Notice that as A^p converges 
to the uniform distribution as t becomes larger, C{A^p) converges to the line x/n and vice versa. 
In [LS90], their method to bound the mixing time is to bound the difference between C{A^p) and 
the line x/n. When A is the lazy random walk matrix, they proved that 

C{A% x) ^ - + v^(l - (2.2) 

n 8 

and this implies that the mixing time of lazy random walks is 0{log n/(j){G)‘^). The key lemma in 
their proof is the following inequality: For any lazy random walk matrix A, any p G M” and any 
integral x, 

C(Ap, x) ^ (C(p, x(l - (/>(G))) + G(p, x(l -)- ^(G)))). (2.3) 

The bound in (2.2) follows from an inductive argument using (2.3); see [LS90, ST13, KL12] and 
also a slightly more general version in Lemma 7 in Section 2.4. We remark that their proof of (2.3) 
crucially relies on the assumption that there is a self-loop of weight 1 /2 on each vertex and is a bit 
magical. 


2.2 Proof Outline 

We mainly outline the proof of Theorem 2 in this subsection, but we will briefly mention the 
modifications to prove Theorem 3 at the end. We will prove the following inequality similar to 
(2.3) using the combinatorial gap (without the laziness assumption that An ^ 1/2 for all i G V). 
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Figure 2.1: An illustration of the proof ideas. 


Lemma 1. For any random walk matrix A, any p G M”' and any integral x ^ n/2, 

C{Ap, x) {C{p, x(l - (p{G))) + C{p, x(l + (p{G)))). 

With Lemma 1 in place of (2.3), the same inductive argument that we mentioned before implies 
the convergence result in (2.2) with 4>{G) replaced by (p{G). It turns out that the analysis of the 
Lovasz-Simonovits curve can be used to analyze the evolving set process, and the arguments in 
Lemma 1 can be adapted to prove Theorem 2. To prove Lemma 1, we consider an arbitrary S' C 1/ 
and try to bound the total probability in S after one step of random walk {Ap){S) := {Ap)i. 
To bound {Ap){S), we look at where the probability in S is coming from. For each i £V, let 

ds{i) := w{i,S) 

be the total weight coming from i to S. Recall that we assume the weighted degree of each vertex 
is one. So, we have ds{i) G [0,1] for any i and 

E ■*«<') = 1^1 ■ P-'i) 

i&V 

The reason of this definition is that (Ap)(S) = ' p{i)- We sort the vertices so that 

^ 5 ( 1 ) ^ ds{2) ^ ... ^ ds{n). Let T := {1,2,... ,|S|} be the |S| vertices with largest ds values; 
note that T is in general not equal to S. See Figure 2.2 for an illustration of the proof setup. 
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The obstruction for mixing is when ds{l) = ds{2) = ... = d^dSI) = 1 and p{l), ■ ■ ■ ,p(|5'|) are the 
highest probability in p, in which case we would have C{Ap, jS'l) ^ (74p)(S') = XlieS p{t) = |5|) 

and thus the curve is not dropping after one step of random walk. This could happen when S = T 
in which case S' is a disconnected component of G (corresponding to A 2 = 1), or when S and T 
form a bipartite component (corresponding to A„ = —1), and in these cases the random walk may 
not mix (depending on the initial distribution). 

The combinatorial gap in (1.1) is defined precisely to exclude the obstruction. It states that any 
subset T with |T| = IS"! can only contribute w{S,T) ^ (1 — <^(G))|5| to w{S,V) = [S'!, so as to 
guarantee that the curve would drop, i.e. C{Ap, |5|) = max 5 /,| 5 /|=| 5 | Ap{S') < C{p, |S|). To prove 
Lemma 1, we look at the bar chart in Figure 2.2 horizontally and consider the telescoping sum 

n n in 

{Ap)(S) = ds{i) -Pii) = ^{ds{i) - ds{i + 1)) ^p(i) < ~ 

i=l i=l j=l i=l 

Suppose we put a threshold 1/2 and consider the upper area x = X]r=i max{d 5 (i) — ^,0} and the 
lower area y = aiin{(i 5 (i), ^}; see Figure 2.2. By concavity of the curve C, we will prove in 
Lemma 2 that 

{Ap){S) ^ i(C(p,2x) + C(p,2y)) = i(C(p, 2x) + C(p, 2|5| - 2x)). 

The definition of combinatorial gap in (1.1) forces ds to spread out, and we will prove in Lemma 3 
that it implies that 

x<i(l-^(G))|5|. 

Combining these two steps gives Lemma 1. Once we prove Lemma 1, the same calculations can be 
used to prove Theorem 2 about the gauge in the evolving set process. 

The proof of Theorem 3 is also based on the idea of using the bar chart in Figure 2.2. First, we 
translate the definition of robust vertex expansion to an upper bound of the area of the largest 
vertices. Then, we will choose a different threshold, and then we will modify the inductive argument 
accordingly to establish a similar bound as in (2.2). The proof will work for the gauge in the evolving 
set process to prove Theorem 3. 

2.3 Combinatorial Analog of Spectral Gap 

As described in the outline, we will prove the following two lemmas. 

Lemma 2. For any subset S CV with |5| ^ n/2, let x = XlieV 

iAp)iS) ^ ^(C(p,2x) + C(p,2(|5| -x))), 

for any random walk matrix A and any vector p G MA. 

Lemma 3. For any subset S CV with IS"! ^ n/2, let x = max{(i 5 (f) 

a; ^ ^(1 - V?(G'))|S'|. 


— 0}, then 


— i,0}, then 
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First, we assume the lemmas are correct and derive Lemma 1. 


Proof of Lemma 1. Let 5* be a subset of V. By Lemma 2, Lemma 3 and the concavity of C, we 
have 

(Ap)(S) 4 ^(C(p, 2x) + C(p, 2(|5| - x))) ^ ^(C(p, (1 - ^(G))|5|) + C(p, (1 + p{G))\S\)), 

where the last inequality holds since for any concave function / and any values a ^ b ^ c ^ d with 
a + d = 6 + c, we have /(a) + f{d) ^ f{b) + /(c). Since the argument applies to all subset S C V 
with l^l ^ n/2, we have 

C{Ap, |5|) = max (^p)(5') ^ ^ {C{p, (1 - ¥p(G))| 5|) + C(p, (1 + ¥.(G))|5|)). 

|o'| — |»b| ^ 


□ 


Next we prove Lemma 2, which follows from the concavity of the curve C. 

Proof of Lemma 2. Recall that {Ap){S) = YhiPi^)' ds{i)- For convenience, we define the boundary 
values to be ds'(O) = 1 and ds{n + 1) = 0, and also define p(0) = 0. Then, there is an index k such 
that ds{k) > 1/2 ^ ds{k + 1). By looking at the bar chart in Figure 2.2 horizontally as in the 
outline and considering the telescoping sum, we have 

n n in 

{Ap){S) = ^ds(z) •p(i) = ^{ds{i) - ds{i + l))C^p{j)) ^ - ds(i + 1)) • C(j),i) 

2=0 2=0 j = l 2 = 0 

= (^^{ds{i) - ds{i + 1)) • C{p, i) + {ds{k) - i) • C{p, k)^ 

+ ( (^ - dsik + 1)) ■ C{p,k) + (dsii) - dsii + 1)) • C{p,i) \ . 

\ ^ i=k+i / 

Recall that C{p) is concave, and for any concave function /, we have by Jensen’s inequality 

2 2 2 ^ 

Since 

k—l 

^{ds(i) - ds(i + 1)) + {ds(k) - i) = ds{0) -1 = 1 

i=0 

and 

{\ - ds{k + 1)) + ^ (dsii) - dsii + 1)) = ^ - dsin + 1) = ^, 
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by applying Jensen’s inequality with f = C{p), we have 


{Ap){S) ^ ^p,2(^'^J^ds{i)-dsii + l))-i + {ds{k)-^)-kJj 

+ - ds{k + l))-k+ ^ {ds{i) - ds{i + 1)) • z) V 

V i=k+l ) 


Finally, note that the first sum 

k—1 ^ k ^ n - 

'^{ds{i) - ds{i + 1)) • f + {ds{k) - -) ■ k = '^{ds{i) - 2 ) = Z]max(ds(z) - 2 >0) = 

2 = 0 i = l 2=1 

Since Y17=oidsii) — ds{i + 1)) • z = ^^1=1 dsi^) = |<S'| by (2.4), the second sum is |5| — x, and so we 
have 

{Ap){S) ^ ^C{p, 2x) + Cip, 2(|5| - x)). 

□ 


Now we prove Lemma 3, which uses the definition of the combinatorial gap in (1.1). 


Proof of Lemma 3. Recall that we sort the vertices such that 1 ^ ^ 5 ( 1 ) ^ ^ ds{n) ^ 0. Let k 

be the index such that ds{k) > 1/2 ^ ds{k + 1). Let T = {!,..., |5|} be the subset of the first |S| 
vertices. We consider two cases. The first case is when k ^ |5|, in which 


|S| 


X = 


^max{d5(z) - ^,0} = '^{ds{i) - \)< ^ ^ X] ^ :^(1 “‘^(<^))l'^l> 

2=1 


2=1 


2=1 


2=1 


where the first inequality holds as ds{i) G [ 0 , 1 ], and the last inequality holds by using ( 1 . 1 ) to 
obtain (p{G) ^ 1 -'u;(5, r)/|S’| = 1 - E 12 W 5 (*)/| 5 '|. 

The second case is when k > |5|. Note that dsdS'j) ^ 1 — p{G), as otherwise T would violate (1.1). 
Then, for any z ^ |5|, we have ds{i) ^ (is(|S'|) ^ 1 — p{G). Therefore, for any z ^ \S\, 

j 1 / .7 z z i \ / ds{i) • (1 - (p{G)) 

ds(.) - 2 < ^-• 

and we have 
k 


X = 


"■ 1 1 ^ 1 1 1 — ir*! ^ 

- ^) + E (-isw - < Ef-'sw - 2 ) + (—1^) E ‘'s(i) 


2 = 1 


2=1 


i=|S|+l 


i=l 


i=|S|+l 


= {w(S,T) - ^) + (1—h^J)(|S| - w{S,T)) = (i±|M).„(5,r) - M + (L_|(ii))|s| 

< (Lt|<S)(i - ,,(G))|S| - J|1 + (Li|<S)|s| < (Li|M)|s|, 
where the second last inequality is by (1.1). Hence, in any case, we have x ^ i(l — <y9(G))|5'|. 


□ 
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Evolving Sets 


Finally, we prove Theorem 2 about the gauge of the evolving set process, which involves very similar 
calculations as in the proof of Lemma 1. Recall that the gauge is defined as ipiS) = 1 — 

The following claim follows from Morris and Peres [MP05]. 

Claim 1 ([MP05], Equation 27). For any t G [0,1], we have 

t ■ E[|5| I ^ t] = min{t, ds{i)}- 
i£V 


Proof. Note that P[i G S | 17 ^ t] = min{l, ^^(z)/!}. It follows that 


E[|5| I 1/ ^ t] = G 5 I 1/ ^ t] = min{l, ds{i)/t}. 

i£V i£V 


□ 


When t = 1/2, Claim 1 implies that 

iE[|5| I ^ 

i£V 

which is the lower area under the threshold 1/2 (see Figure 2.2). And 

^E[\S\ \ U^^] =E[5]-iE[|5| I ^ 

iev iev iev 

which is the upper area above the threshold 1/2, and this is at most ^(1 — v^(G))|S'| by Lemma 3. 
We are ready to prove Theorem 2. 


Proof of Theorem 2. By definition, 


f,{S) = 1 - E[y/|5|/|5|] = 1 - ^E[y^|5|/|5| \U^l]- ^E[y^|5|/|5| \ U ^ 

Let X = Yli ra8ix{ds{i) — 0} be the upper area. We know that x ^ |(1 — <y9(G))|5'| by Lemma 3. 

By concavity. 


Similarly, 


F 1 


1, 1 2x 


-E[V|5|/15| I 1/ ^ ^ \ u^-] = - J — 


| 5 | 


|5| 


1 , 1 


1 2(|5|-x) 


-E|\/|S|/|S||[/<-|<-Jj^E||S||[/<-| = - 1^1 


12 












Plugging these two inequalities back into the first one, we have 


^ 1 - -(^2 — j^+ ^ 1 - -(-v/l + ip{G) + a/I - <f{G)) ^ </j(G)^/8, 

where the second inequality is because \/2 — 2x/\S\ + \j2x/\S\ ^ i/2 — 2y/\S\ + Y^2y/|5| for 
X ^ y ^ \S\/2 and we put in ?/ = ^(1 — <^(G))|S'|, and the last inequality is by Taylor expansion of 
the function + ip. □ 


Small-Set Expansion of Graph Powers 


In this subsection, we prove Corollary 1 about small-set expansion of graph powers. Recall that 
the 5-small-set expansion is dehned as 


UG) 


min 1 — 

SCV,\S\^5n 


W{S, S) 


We dehne the 5-small-set combinatorial gap as 


MG) 


min 1 

scy,Tcy,|s|=|rR5n 


w{S,T) 

| 5 | 


The following is a simple relation between these two quantities. 
Lemma 4. For any 5^1/2, we have ips/ 2 {G) ^ (ps{G)/2. 


Proof. Suppose S,T are two subsets of size at most 5n/2 that achieve w{S,T) = (1 — (/J 5 / 2 (G))I‘S'| 
and |5| = |T|. We argue that 5 U T has small expansion, since 

u;(5uT,5ur) ^ 2w{S,T)-w{SnT,SnT) ^ 2{l - ip^/ 2 {G))\S\ - |5nr| 

= 2|5| -\Sr^T\- 2\S\ip5,2{G) = IS U T| - 2\S\<fs/2{G) ^ |SU r|(l - 2ips/2(G)). 

Since |S U Tj ^ 2|S| ^ 6n, this implies that (j)s{G) ^ 1 — w{S U T, S U T)/|S U T| ^ 2ip^/2iG). □ 


In [KL14], using the inequality (2.3), it is proved that for any lazy graph, 

4>s/2{G^) > ^{uim{Vi ■ MG), !))• 

The assumption of laziness is only used for the inequality (2.3) to hold. Now, with Lemma 1, we 
can replace (f 5 {G) by y^siG) and prove that for any graph G, 

4>5/2{G^) ^ II(min(\/f • ipsiG), 1)). 

Combining with Lemma 4, it follows that for any graph G, 

h/4:{G^) > II(min(\/t • (l)s{G), 1)). 
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2.4 Vertex Expansion 


We will prove Theorem 3 in this subsection. As in Section 2.3, we will first prove faster convergence 
for random walks and then for evolving sets. Intuitively, larger vertex expansion will lead to faster 
mixing, as the probability coming into a set S is from many different vertices, and so it cannot 
be the case that all probability in S come from a small number of vertices with high probability. 
Our proof idea is also to look at the bar chart in Figure 2.2, and translate the definition of robust 
vertex expansion into an upper bound of the area of the largest vertices. 

For S' C f/, we consider the vector ds € M” with the i-th. entry being ds{i), and as before we 
assume that ^^(l) ^ ds{2) ^ ^ ds{n). We use the notation C{ds,x) in (2.1) on ds so that we 

can talk about the largest x values in the vector ds (note that x could be non-integral). 

In Theorem 3, unlike in Theorem 2, we need the additional assumption that the random walk 
matrix is lazy, such that w{i, i) ^ 1/2 for any z G E. The main reason of this assumption is to have 
ds{i) ^ 1/2 for z G S and ds{j) ^1/2 for j ^ S, so that we can assume that 

S = {1,2,...,|S|}, 


i.e. the vertices in S are the vertices with the largest values in ds. Recall that we defined 
Vi/ 2 (S) :=min{|r| | T C R - S and w{S,T) ^ ^w{S,V - S)}. 

Since S = {!,...,|S|}, we have for any integral x, 


max w(S,T) = max > ds(i) 
T:TCV-S,\T\=x T:T<ZV-S,\T\=x^ 

- >1 I _ >1 I 


\S\+x 

ds{t) = C{ds,\S\+x)-C{ds,\S\). 


i=|5|+l 


In words, the set of vertices T CV — S that maximize w{S, T) are the vertices US’! -|-1,... , IS"! -|-|T|} 
in the ordering defined by ds. So, we can rewrite the definition of Nii 2 {S) as 

Vi/ 2(5) := min{x | C{ds, |5| + x) - C{ds, |5|) ^ iu;(5, E - 5)}. 

Note that we allow x to be non-integral. This differs by at most one compared with the original 
definition in [KLM06], and will make our proofs much cleaner. The robust vertex expansion is 
defined as (j)^{S) := min{V;^/2('S')/|5'|, 1} and (/^{G) := min 5 .| 5 |^|y |/2 0^(5') as before. Similarly, 
4'(S') is defined as before using the new definition of Ni/ 2 {S). The following lemma translates the 
definition of {S) to a bound on the cumulative sum of the largest vertices in the bar chart. 


Lemma 5. For any S CV with [S'! ^ |E|/2, we have 


C{ds,il + cP^iS))\S\)^{l 


2 


)l^l- 


Proof. Since C{ds, l^l +x) is continuous with respect to x, the minimum in the definition of Ni/2{S) 
is attained when C{ds, IS"! -t- x) — C{ds, IS*!) = ^wiS, E — 5). As x = Ni/2iS) ^ (/''^(S')|5'|, we have 

C{ds, (1 + cf^iS))\S\) - C{ds,\S\)^ ^w{S, V - S). 
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Finally, since 5 = {1,..., |5|}, we have C{ds, l^l) = w{S, S) = {1 — (/>(5))|S'| and w{S, V — S) = 
(l){S) ■ |5|, we have 

□ 


Using Lemma 5, we will prove a bound similar to that of Lemma 1. The two steps (Lemma 2 and 
Lemma 3) of proving Lemma 1 are integrated and steamlined in the proof of the following lemma. 

Lemma 6. Assume (7(^5,0151) ^ b\S\ for any |S| ^ re/2 for some a > 1 and b < 1, then for any 
p G M”, we have for any S with IS"! ^ re/2, 

C{Ap, |S|) « (/^) . 0{p, 6|S|) + (^) . C(p, ^). 


Proof. Using the same concavity argument as in Lemma 2, for any threshold t G (0,1) and k such 
that ds{k) > t ^ ds{k + 1), we have for any S with l^l ^ re/2, 

/ k-i 

{Ap){S) ^{l-t)-C ip, ( '^{dsii) - dsii + 1)) • z + {ds{k) - t) ■ k'^ 

\ i=0 

n 

p, - (^{t - ds{k + 1)) • A: + ^ {ds{i) - ds{i + 1)) • z) 

i=k-\-l 



Let X = YM=Q{ds{i) — ds{i + 1)) • z + {ds{k) — t)-khe the upper area above the threshold t. Then, 
it follows that 


C{p, |5|) = max(Ap)(S’) ^ (1 - t) • C{p, + t • C{p, ). 


It remains to prove an analog of Lemma 3 to bound the upper area x. We again consider two cases. 
The first case is when k ^ ajS'l, in which 
k k 

X = '^{ds{i) -t) ^ ^(1 - t) ■ ds{i) = {1-t) ■ C{ds, a|5|) < (1 - t) • b\S\, 

i=\ i=l 


where the last inequality uses the assumption that C((i 5 ,a|S'|) ^ b\S\. The second case is when 
k > a|5|. Note that for any z > a\S\, ds{i) ^ d 5 (a|S'|) ^ b/a, as otherwise the assumption 
C{ds,a\S\) ^ 6|S'| would be violated. Hence, for any i > a|S'|, 


ds{i) - t ^ • ds{i) = (1 - y) • ds{i), 


and we get 


X = '^{dsii)-t) ^ C{ds,a\S\) - a\S\t+ ^ (1 - y) • d 5 (z) 


i=l 


i=a|5|+l 


= C{ds,a\S\) - a|5|t + {\S\ - C{ds,a\S\)){l - j) ^ |5|(1 - j), 
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where the last inequality uses the assumption that C{ds, a|*S'|) ^ b\S\ and some simple calculations. 
We balance the two upper bounds ^l/SKl — t) and |5'|(1 — at/b) by choosing t = {b — b’^)/{a — 6^), 
so that in both cases we have 

b{a — b) 


X < 


-| 5 |. 


(2.5) 


a — 

Putting the choice of t and the bound on x back (and using concavity), we obtain the conclusion 
of the lemma. □ 


Lemma 6 is a generalization of (2.3), and we can use it to derive a generalization of (2.2). Note 
that any probability distribution p satisfies the condition C{p,x) ^ ^ + y^min{x, n — x} in the 
following lemma. 

Lemma 7. Assuming (7(^5,0151) ^ 6|5| for any |5| ^ n/2 for some a > 1 and 6 < 1, then for any 
p € M”' satisfying C{p,x) ^ + csj min{x, n — x} for all x E [0, n] for some c, we have 

C{A^p, x)^- + • (1 - 

n Ja + b 


Proof. We only consider the case that x ^ n/2 in the following; the case x > n/2 can be handled 
in the same manner. When x ^ n/2, we have C{p,x) ^ ^ + c^/x. By Lemma 6, 

C{Ap, x) ^ {-—^) • C(p, bx) + {-—^) • C{p, ^) 
a — 0 ^ a — 0 ^ b 


/ Qj — 6 \ ,X n —^ / b — 6^ \ . X j ILJb , 

^ ^ - + + - ).{- + C\ ) 

a — b^ n a — b^ n ^ 


X a — b 

= -h CV X 

n 


((a — 6) + (1 — b)\/a)yfb\ 


X 

= —V c^x 
n 


a — b'^ 

1 


a — b‘^ 

= — + c^/x ■ (- ~ + Va)Vb) 

a — b^ 


n 


X ^ .(1 + v^)a/6x 

—-^ ® -T-) ■ 

n ^ Ja + b ' 


Now note that. 


1 - 


{\pyfa)\lb yfa P b — \lb — \fah (y^ — \/6)(l — v^) 


a + b 


0 + 6 


y/a + 6 

The above argument shows that C'(Ap, x) ^ ^ + c-yx(l — (y/o — \/6)(l — \/6)/(y/a + 6)). Apply the 
same argument inductively (with different c in each iteration) gives the lemma. □ 


In particular, for any probability distribution p, we have the following generalization of (2.2): 

{y/d-Vb){l-Vb) ^ 


C{A*P,x) ^-h y/x ■ (1 - 

n 


0 + 6 
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Evolving Sets 

We prove Theorem 3 about the gauge in the evolving set process. 


Proof of Theorem 3. By Claim 1, for any t E [0,1], the lower area below threshold t is 

t ■ E[|,S| I ?7 ^ t] = min{t, = |S| — max{(is'(i) — t, 0}, 

i&v iev 

and the upper area above threshold t is 

(1 - t) • E[|5| \U^t]= E[5] - t • E[\S\ \U !^t] = Y^ max{d 5 (i) - t, 0}. 

i£V 

By the same argument in Lemma 6 , using the assumption C{ds,a\S\) ^ 6|5| and setting t 
{b — h^)l(a — 6 ^), the upper area above threshold t is 

X := ^ma.x{dsii) -t,0} ^ 

a — b^ 


as stated in (2.5). Therefore, 


V>(5) = 1-E[V|5|/|5|] 


= l-f-E[V|5|/|S| I [/^t]-(l-t)-E[V|5|/|5| \ U^t] 


^ 1 - tVlE[|5|/|5| I 17 ^ t] - (1 - t)VE[|S|/|S| \ U^t] 

= 1-ti 


1 |5| -X 


| 5 | 


t 


-(l-i)i 


1 


X 






1 


b{a — b) 


(1 — t)\S\ a — b"^ 


|5|, 


where the first inequality is by the concavity of the square root function, and the second inequality 
by the following fact: Suppose / is a concave function and ci ^ C 2 ^ C 3 ^ C 4 satisfy tci + (1 — t)ci 
tc 2 + (1 — t)c 3 for some t E [0,1], then tf{ci) + (1 — t)f{c 4 ) ^ tf ( 02 ) + (1 — t)/(c 3 ). Note that 


^(151 _ _ bja-b) . 

t|5r ' a-62 6-62^ a-62 ' 


a — b'^ — b[a — b) a 
6-62 = 1 ’ 


and 


Hence, 


~ b) _ 0 , — b'^ b{a — 6) _ ^ 

■|h| = ^ 77 ;- = 6. 


a — b a — b'^ 


(1 — t)\S\ a — b'^ 
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where the last inequality follows from the calculations in Lemma 7 (starting from the third line in 
the first block of calculations). We put a = 1 + (p^{S) and 6 = 1 — Note that since (S) ^ 1 
by dehnition, we have > 1 + cj)^{S)/3. This is the only place we need to assume 0^(5) ^ 1. 
On the other hand Vb ^ 1 — (f){S)/2. Therefore we have 

^ 3 ^18 18 ■ 

□ 


Local Graph Partitioning 

We obtain Corollary 2 about the performance of the evolving set algorithm in [APOD, OT12]. 
In Lemma 5.2 of [OT12], Oveis Gharan and Trevisan actually showed that the (volume biased) 
evolving set process will return a set S with ?/>(5) = 0{(p{S*)/e), and they used the fact that 
ipiS) = to get Theorem 1. Now, with Theorem 3, we can replace by 'I'(5') and 

obtain Corollary 2. 


2.5 General Graphs 


Our results generalize to non-regular undirected graphs, with appropriate changes in various defi¬ 
nitions. 

Expansion: For a general undirected graph G, we use vol(5) := Ylies deg(i) to denote the volume 
of a subset S. It is the non-regular analog of the size |S'|. The conductance of a set S' C 1/ and the 
conductance of the graph are defined as 


HS) := 


w{S,V -S) 
vol(S) 


and 


</>(G) := 


min </>(S'). 

5:vo1(S)!£vo1{V')/2 


Ideally, the analog of the combinatorial gap would be 


^{G) := 


min 1 — 

S,T:vol(S)=vol(r)sgvol(\/)/2 


w{S,T) 

vol(5) 


However, this dehnition may not say much since it can happen that any two different subsets have 
different volume. In order to handle this situation, we revise the dehnition and allow S and T to 
be fractional. Let d be the degree vector of G. We dehne the combintarial gap as 


^{G) := 


min ^ ^ 1 — 

XsG[0,l]'^,XTG[0,l]'^:(Xs,'h = (XT,d>sSvol(F)/2 


{xs,Axt) 

(xs,d) 


Lovasz Simonovits curve: In general graphs, the Lovasz Simonovits curve G{p) : vol(P) —?• R is 
dehned as 

G{p,x):= max __ {c,p). 

cG.[0,l]'^:{c,d)=x 
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Suppose the vertices are sorted so that p(l) > p(2) > ■ ■ ■ > p{n). The extreme points of the curve 
C{p) is Ej=i deg(i) for j = 0,...n. 

Bar chart: We sort the vertices so that ds{i)/ deg(i) is decreasing. We should view the bar chart 
so that each bar has width deg(i) and height ds{i)/ deg(i). So the total width is vol(y). Same 
as before, we put a threshold 1/2 (or choosing another threshold t E [0,1] in the proof for vertex 
expansion) and consider the upper area x and lower area y, and show that 

{Ap){S) ^ ^{C{p,2x) +C{p,2y)). 

With this figure in mind, the proofs for the extended results are essentially the same as the original 
proofs. 

Vertex expansion: The robust vertex expansion is defined as follows. Let q{i) = ds{i)/ deg(z), 
and 

Ni/ 2 {S) ■■= min{x | C{q,Yo\{S) + x) - C'(q', vol(5)) > 

Then (j)^{S) := min{iVi/2(5)/vol(5), 1}, and := mms■,^ol(S)^^;oliV )/2 (S). 

Restating the Theorems for General Graphs 

In the following, we restate our results on general graphs without proofs. Lemma 1 becomes 
Lemma 8. For any extreme point x ^ vol(V)/2, 

C{Ap,x) ^ ^(C'(p,x(l - (p{G))) +C{p,x{l + ip{G)))). 


Lemma 2 becomes 

Lemma 9. For any subset S CV with vol(5) ^ vol(V)/2, let 

a: = gdega).max{^-io), 

then 

{Ap){S) ^ ^{C{p,2x),G{p,2{vol{V) - x))), 
for any random walk matrix A and any vector p E M"’. 

Lemma 3 becomes 

Lemma 10. For any subset S CV withvol{S) ^ vol(V)/2, letx = deg(i) mBx.{ds{i)/ deg(i) — 
1/2,0}, then 

a; ^ ^(1 - ‘^(G'))vo1(5). 

Lemma 6 becomes 
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Lemma 11. Let q{i) := ds{i)/ deg(i). Assume C{q, avol{S)) ^ bvol{S) for any S with vol(S') ^ 
vol(y)/2 for some a > 1 and b < 1, then for any p G M”, we have 

C{p,vol{S)) ^ i) 
a — b^ 

With these definitions and lemmas in 
introduction. 


— avol(5), 

• C j>,6vol 5 + -^ • CCp,-^ . 

a — b^ b 

place, Theorem 2 and Theorem 3 hold as stated in the 


3 Limitations 

We prove Theorem 4 that provides a hard small-set expansion instance for the evolving set process 
studied in [AP09, OT12]. As mentioned in the introduction, it will be a noisy hypercube H over 
alphabet size k = 1/5. Formally, H is a graph on k'^ vertices, representing all strings of length d 
over alphabet [k]. For two vertices x,y, the edge weight w{x,y) is set to be the probability to go 
from vertex x to vertex y in one step of a random walk, where each symbol of x is independently 
rerandomized with probability e: For each i G [d], with probability 1 — e, set yi = Xi, otherwise yi 
is sampled uniformly at random from [k]. Note that H is 1-regular. It is easy to see that H has a 
small sparse cut. 

Claim 2. There is a set S with expansion at most e and IS"! = 6n where 5 = 1/k. 

Proof. Indeed, the coordinate cut S = {x £ [k^ | xi = 0} has size 5n and expansion at most e. □ 

We will show that all the sets explored by the evolving set process have expansion close to one. 
First, we argue that the evolving set process will only explore the Hamming balls of the noisy 
hypercube in Lemma 12. Then, we will show that the expansion of all Hamming balls of size 0{5n) 
is close to one in Lemma 13. 

The evolving set process starts from a singleton set on H. By symmetry, we may assume this set is 
{0'^}. We now show that the evolving set process only explores sets that are Hamming balls B{r) 
(around 0“^), where B{r) denotes all strings of Hamming weight at most r: 

B{r) := {x G [A:]'^ | |x| < r} where \x\ := |{z G [d] | Xi / 0}|. 

Indeed, the initial set {0'^} is the Hamming ball B{0). The following lemma implies that, if the 
current set is a Hamming ball B{r), then so is the next set, and thus by induction the evolving set 
process will only explore Hamming balls. 

Lemma 12. Suppose e ^ 1/2. For any r ^ 0, any x,y £ [k]'^, if |x| ^ \y\, then 

w{x,B{r)) ^ w{y,B{r)). 

(It follows that if S is a Hamming ball, then y £ S implies that x £ S, and thus S is also a 
Hamming ball.) 
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Proof. Note that w{x,z) depends only on the Hamming distance |x—(coordinate-wise subtraction 
modulo k). We first show via a symmetry argument that 

w{x,B{r)) = w{y, B{r)) whenever |x| = \y\. (3.1) 

To this end, we will construct a permutation vr on [k^ that (i) preserves Hamming distances: 
| 7 r(a)— 7 r( 5 )| = \a—b\ for all a,b ^ [k]'^, (ii) tt{x) = y, and (iii) 7 r( 0 ‘^) = O®*. Assuming this permutation 
exists, we get that z G B{r) if and only if 7t{z) G B(r), since \'x{z) — 0| = \'x{z) — vr(0)| = \z — 0|. 
Also, \x — z\ = |vr(a;) — t^{z)\ = \y — thus 

w{x,z) = w{y,TT{z)). 

Summing this equality over all z G B{r), we get (3.1). 

We now construct such a permutation vr. Take any bijection a on [d] that maps I := {z | Xj 7 ^ 0} 
onto {i \ yi ^ 0}. For z G /, let Tj be the permutation on [k] that simply swaps Xi and ya{i)- Then 
we define 7r(a) = b where 6* = a* if z ^ I, and = Ti{a„(^i-)) otherwise. It is easy to verify that tt 
has all the required properties. 

We now deal with the general case \x\ < \y\. It suffices to prove the lemma assuming \y\ = \x\ + 1. 
By (3.1), we may assume that x is the indicator vector on a subset S = [c] for some c, and y is the 
indicator vector on [c -|- 1], so that y differs from x only at position c. Picking a random neighbor 
Z of X is equivalent to picking W = x — z and setting Z = x + W, so the lemma is equivalent to 
-|- IF G B(r)] ^ Fw[y -|- IF G B{r)]. In fact, we will show this inequality conditioned on all 
values of IF* except i = c. Let W-c G be a fixing of all those values. We will show 

Pwc[a; -b IF G B{r) \ W-c = W-d > ^wAv + IF G B{r) \ W-c = W-c]. (3.2) 

There are two cases. If |x_c + W-d A then x -|- IF and y + W are both in B{r) or both outside 
of B{r), and therefore (3.2) holds as an equality. In the remaining case, the left hand side of (3.2) 
is at least P[IFc = 0] ^ 1 — e, while the right hand side is at most P[IFc 7 b 0] ^ e, so the inequality 
follows by our assumption that e ^ 1 / 2 . □ 

We now show that any small Hamming ball has large expansion. The same result appears earlier 
in [CMN14]. We give a proof below, filling in some missing details. The main idea is to show 
that the Gaussian noise graph is a small-set expander using a hypercontractiviy inequality, and to 
use the central limit theorems to translate this result to reason about the Hamming balls in the 
noisy hypercube graph. This connection between Gaussian noise graphs and noisy hypercubes was 
used commonly in showing integrality gap examples for convex relaxations, and here it is used in 
showing limitations for random walks based algorithms. 

Lemma 13. For any e,r] > 0, there exists 6 = 5c,r) (independent of k) such that for any sufficiently 
large d ^ dc^s,r], all Hamming halls of size ^ 5n has expansion 1 — 0{r]). 

Proof. We will analyze the expansion of a Hamming ball B{r) by relating B{r) to halfspaces 
A^i := {x G M I X ^ r'} in Gaussian probability space. 
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Consider drawing a random edge {x, y) from H according to its weight, and we would like to 
analyze the probability that both vertices are in B[r). The event x G B[r) is the same as |x| = 
l{xi 7 ^ 0) ^ r. Since |x| is a sum of independent random variables and each summand 
has bounded third moment, by Berry-Esseen central limit theorem, for large d, the sum is closely 
approximated by a Gaussian random variable g with the same mean and variance as |x|. That is, 
for all large enough d ^ de,<5,)?) 

P[|x| ^ r] P[g' ^ r] for all r G M (3.3) 

for some 5 ' depending on e, S, rj to be specified later. Here we write C D to mean \C — D\ ^ 5 ' . 

Moreover, multivariate central limit theorem (e.g. [Saz68]) implies that the event (|a:| ^ r)A(|7/| ^ r) 
has roughly the same probability as the event {g ^ r) A {h ^ r), where the bivariate Gaussian {g, h) 
has the same mean and covariance as (|x|, |y|). That is, for large enough d, 

Pr[(|x| < r) A (|y| ^ r)] Kiy Pr[{g ^ r) A (h ^ r)] for all r G M. (3.4) 

We note that the following calculations do not depend on the dimension d other than the GLT 
approximation errors (as we are not concerned about the graph size n = /c'^), so we can choose a 
very large d at the end to make the GLT approximation errors 5 ' to be arbitrarily small for the 
proof to go through. 

Shift g and h to have zero mean and renormalize them to have unit variance. We get g' = 
{g — 'K{g\)/\/'^sx[g\ from g and similarly h' from h. Then g' and h' have covariance 

Gov( ' h') = ^ Gov(l(xi /0),l(j/i /O)) 

^ ’ Y^Var[|x|] Var[|y|] ^yYaT[l{xi / 0)] Var[l(yi / 0)] 

We have Gov(l(xi / 0), l(yi / 0)) = (1 — e) Var[l(xi / 0)], so Cov{g',h') = 1 — e. Therefore, 

P[( 5 r ^ r) A (/i ^ r)] = P[(g'' ^ r') A {h' ^ /)], 

where r' is chosen so that P[ 5 ' ^ r'\ = P[g' ^ r]. 

For standard Gaussians g' and h' with covariance 1 — e, we claim that 

F[h' ^r' \g' r'] ^ F[g' ^ (3.5) 

This inequality follows from Gaussian hypercontractive inequality (e.g. [ODol4, Section 11.1]) 

E (<?')! {h )] ^ II1^.^' II2—e) 

(1 — e) correlated g' ,h' 

where ^ ^0 is the indicator function for the halfspace A^i, and the fact that 

||l^,H|2-e = = ng' ^ 

Set 5 := 7^1"^. Note that for r' small enough so that ¥[g' ^ r'\ ^ <5, then (3.5) implies that the 
halfspace has expansion 

F[h' > r I 5' ^ r'j = 1 - F[h' ^ r' | 5' ^ r'j ^ 1 - Pl^' ^ ^ 1 - 5 "/^ = l-g. 
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We use (3.3) and (3.4) to translate this expansion result from Gaussian space to the noisy hypercube. 
Let 6" ^ (5 be a constant depending on e, 5, rj to be specified later. Any Hamming ball of H of size 
at least 6"n corresponds to a halfspace of roughly the same Gaussian measure via (3.3), and has 
roughly the same noise stability via (3.4). Ghoosing the CLT approximation error 5' := 5"r], we can 
ensure that all Hamming balls B(r) of size between 5"n and 5n have expansion ^ 1 — r/. Indeed, 

P[a: and y € B{r)] ^ P[g'' and h' € Ar'] + 5' 


by (3.4) and 

d ^ P[x G B{r)] ^ ¥[g' G A^/]/2 
by (3.3) and our assumption that P[x G B{r)] ^ 5", so 

1 - (j)iB{r)) = y ^ ^ “*■ 


[P[x G B{r)] 

We now analyze the right hand side. We have 


P[(7' G Ay./] 




by (3.5) and the fact that P[ 5 f' G Ar'] ^ 2P[a: G B{r)] ^ 26. Also 6'/¥[g' G Ar'] ^ 26'/¥[x G B{r)] ^ 
26' 16" = 2g. Therefore (f){B{r)) ^ 1 — 0{g), for those B(r) of size between 6"n and 6n, as required. 

To deal with Hamming balls of size smaller than 6"n, we simply apply the hypercontractive in¬ 
equality on H directly. For any subset B on H (not necessarily a Hamming ball), we have 

P[y G H I X G H] ^ P[x ^ ^jce/iog(i/5) 

for some c > 0. The exponent ce/\og{l/6) is from [Wol07]. Taking 6" := we see that 

^{B) ^ 1 — g whenever |i?| ^ 6"n. □ 


The key point of Lemma 13 is that everything is independent of k. Therefore, given any e, we 
just need to set fc ^ l/d so that H has a set of expansion e and size ^ 6n (by Glaim 2), while 
the evolving set process only explores Hamming balls (by Lemma 12) and all Hamming balls of 
size ^ 6e,rf'n have expansion ^ 1 — e (by Lemma 13). This proves Theorem 4 that the evolving set 
process fails on the A:-ary e-noisy hypercube with probability one. 


Random Walks, Personal Pagerank, and Heat Kernels 

The random walk local graph partitioning algorithm [ST13, ABSIO, KL12] works by computing 
the vector pt ;= A'xv for every vertex u for 1 ^ t ^ O(logn), sorting the vertices so that pt(l) ^ 
Pt{2) ^ ... ^ Pt{n), and trying all the level sets {1,... ,j} for 1 ^ j ^ n. Using the same k- 
ary noisy hypercube example, it is not difficult to see from Lemma 12 that all the level sets that 
the algorithm explored are Hamming balls, and thus the random walk algorithm will also fail to 
disprove the small-set expanson hypothesis. 
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The same argument also applies to the personal pagerank algorithm [ACL06, ZLM13] and the heat 
kernel algorithm, which work by computing some related vectors and trying all the level sets. We 
note that the vectors used by these algorithms are just convex combinations of the random walk 
vectors A^Xv for different t, and therefore all the level sets are Hamming balls, and hence these 
algorithms also fail for the same reason. 

We believe that this example exposes the limitations of all known local graph partitioning algo¬ 
rithms, and can be used as a basis to prove further lower bounds. An interesting question is to 
study whether the analysis of the 0{y/4>{S) log(|5|))-approximation of the evolving set algorithm 
in Theorem 1 is tight when e = l/log(|5|). 
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